Algorithms for Processing of Spatial Queries using R-trees. The Closest Pairs Query and its Application on Spatial Databases

نویسندگان

  • ANTONIO LEOPOLDO
  • CORRAL LIRIA
  • Yannis Manolopoulos
  • José Samos
چکیده

This thesis addresses the problem of finding the K closest pairs between two spatial datasets (the so-called, K Closest Pairs Query, K-CPQ), where each set is stored in an index structure belonging in the R-tree family. There are two different techniques for solving this kind of distance-based query. The first is the incremental approach [HjS98, SML00], which computes the operation in the sense that the result elements are reported one-by-one in ascending order of distance. The second is the non-incremental alternative that we present in this thesis, which computes the operation reporting the K elements of the result all together at the end of the algorithm. Branch-and-bound has been the most successful technique for the design of algorithms that obtain the result of queries over tree-like structures. In this thesis, a general branch-and-bound algorithmic schema for obtaining the K optimal solutions of a given problem is proposed. Moreover, based on distance functions between two MBRs in the multidimensional Euclidean space, we propose a pruning heuristic and two updating strategies for minimizing the pruning distance, in order to use them in the design of three nonincremental branch-and-bound algorithms for K-CPQ between spatial objects stored in two Rtrees. Two of those approaches are recursive, following a Depth-First searching strategy and one is iterative, obeying a Best-First traversal policy. The plane-sweep method and the search ordering are used as optimization techniques for improving the naive approaches. Besides, a number of interesting extensions of the K-CPQ (K-Self-CPQ, Semi-CPQ, K-FPQ (the K Farthest Pairs Query), etc.) are discussed. An extensive performance study is also presented. This study is based on experiments performed with real spatial datasets. A wide range of values for the basic parameters affecting the performance of the algorithms is examined. The outcome of these studies is the designation of the algorithm that wins the performance award for each setting of parameter values. An experimental study of the behavior of the proposed K-CPQ branch-and-bound algorithms in terms of scalability of the dataset sizes and the value of K is also included. Finally, our non-incremental algorithms have been adapted to perform the simulation of the incremental closest pairs query.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cost models for distance joins queries using R-trees

The K-Closest-Pairs Query (K-CPQ), a type of distance join in spatial databases, discovers the K pairs of objects formed from two different datasets with the K smallest distances. Recently, branch-and-bound algorithms based on R-trees have been developed in order to answer K-CPQs efficiently. For query optimization purposes, analytical models are needed to estimate the processing cost of a spec...

متن کامل

Processing Distance Join Queries with Constraints

Distance-join queries are used in many modern applications, such as spatial databases, spatiotemporal databases, and data mining. One of the most common distance-join queries is the closest-pair query. Given two datasets DA and DB the closest-pair query (CPQ) retrieves the pair (a,b), where a ∈ DA and b ∈ DB, having the smallest distance between all pairs of objects. An extension to this proble...

متن کامل

A performance comparison of distance-based query algorithms using R-trees in spatial databases

Efficient processing of distance-based queries (DBQs) is of great importance in spatial databases due to the wide area of applications that may address such queries. The most representative and known DBQs are the K Nearest Neighbors Query (KNNQ), q Distance Range Query (qDRQ), K Closest Pairs Query (KCPQ) and q Distance Join Query (qDJQ). In this paper, we propose new pruning mechanism to apply...

متن کامل

New plane-sweep algorithms for distance-based join queries in spatial databases

Efficient and effective processing of the distance-based join query (DJQ) is of great importance in spatial databases due to the wide area of applications that may address such queries (mapping, urban planning, transportation planning, resource management, etc.). The most representative and studied DJQs are the K Closest Pairs Query (KCPQ) and εDistance Join Query (εDJQ). These spatial queries ...

متن کامل

Efficient k Nearest Neighbor Queries on Remote Spatial Databases Using Range Estimation (Draft Version)

K-Nearest Neighbor (k-NN) queries are used in GIS and CAD/CAM applications to find the k spatial objects closest to some given query points. Most previous k-NN research has assumed that the spatial databases to be queried are local, and that the query processing algorithms have direct access to their spatial indices, e.g. R-trees. Clearly, this assumption does not hold when k-NN queries are dir...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002